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Abstract 

Numerous empirical proofs indicate the adequacy of the time discrete auto-regressive stochas- 
tic volatility models introduced by Taylor |Tay86[ |Tay05| in the description of the log-returns of 
financial assets. The pricing and hedging of contingent products that use these models for their 
underlying assets is a non-trivial exercise due to the incomplete nature of the corresponding mar- 
ket. In this paper we apply two volatility estimation techniques available in the literature for these 
models, namely Kalman filtering and the hierarchical-likelihood approach, in order to implement 
various pricing and dynamical hedging strategies. Our study shows that the local risk minimization 
scheme developed by FoUmer, Schweizer, and Sondermann is particularly appropriate in this setup, 
especially for at and in the money options or for low hedging frequencies. 

Keywords: Stochastic volatility models, ARSV models, hedging techniques, incomplete markets, local 
risk minimization, Kalman filter, hierarchical-likelihood. 

1 Introduction 

Ever since Black, Merton, and Scholes introduced their celebrated option valuation formula |BS72[ 
IMer76| a great deal of effort has been dedicated to reproduce that result using more realistic stochas- 
tic models for the underlying asset. In the discrete time modeling setup, the GARCH parametric 
family |Eng82[ IBol86| has been a very popular and successful choice for which the option pricing and 
hedging problem has been profusely studied; see for example |Dua951 iHNOOl IBEK+Ill \GCim [QrtTf)] , 
and references therein. Even though the GARCH specification can accommodate most stylized features 
of financial returns like leptokurticity, volatility clustering, and autocorrelation of squared returns, there 
are mathematical relations between some of their moments that impose undesirable constraints on some 
of the parameter values. For example, a well-known phenomenon |CPnR04] has to do with the rela- 
tion that links the kurtosis of the process with the parameter that accounts for the persistence of the 
volatility shocks (the sum of the ARCH and the GARCH parameters, bound to be smaller than one in 
order to ensure stationarity) . This relation implies that for highly volatility persistent time series like 
the ones observed in practice, the ARCH coefficient is automatically forced to be very small, which is in 
turn incompatible with having sizeable autocorrelation (ACF) for the squares (the ACF of the squares 
at lag 1 is linear in this coefficient). This situation aggravates when innovations with fat tails are used 
in order to better reproduce leptokurticity. 

The structural rigidities associated to the finiteness of the fourth moment are of particular importance 
when using quadratic hedging methods for there is an imperative need to remain in the category of 
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square summable objects and finite kurtosis is a convenient way to ensure that (see, for example |OrtlO[ 
Theorem 3.1 (iv)]). 

The auto-regressive stochastic volatility (ARSV) models |Tay86[ |Tay05| that we will carefully de- 
scribe later on in Section [2] are a parametric family designed in part to overcome the difficulties that 
we just explained. The defining feature of these models is the fact that the volatility (or a function of 
it) is determined by an autoregressive process that, unlike the GARCH situation, is exclusively driven 
by past volatility values and by designated innovations that are in principle independent from those 
driving the returns. The use of additional innovations introduces in the model a huge structural leeway 
because, this time around, the same constraint that ensures stationarity guarantees the existence of 
finite moments of arbitrary order which is of much value in the context of option pricing and hedging. 

The modifications in the ARSV prescription that allow an enriching of the dynamics come with a 
price that has to do with the fact that conditional volatilities are not determined, as it was the case 
in the GARCH situation, by the price process. This causes added difficulties at the time of volatility 
and parameter estimation; in particular, a conditional likelihood cannot be written down in this context 



(see later on in Section 2.1 1. This also has a serious impact when these models are used in derivatives 
pricing and hedging because the existence of additional noise sources driving the dynamics makes more 
acute the incomplete character of the corresponding market; this makes more complex the pricing and 
hedging of contingent products. 

The main goal of this paper is showing that appropriate implementations of the local risk minimization 
scheme developed by Fdllmer, Schweizer, and Sondermann {FSSR \FS91l \Sch01f tailored to the ARSV 
setup provide an appropriate tool for pricing and hedging contingent products that use these models 
to characterize their underlying assets. The realization of this hedging scheme in the ARSV context 
requires two main tools: 

• Use of a volatility estimation technique: as we already said, volatility for these models is 
not determined by the price process and hence becomes a genuinely hidden variable that needs 
to be estimated separately. In our work we will use a Kalman filtering approach advocated 
by |HRS94) and the so called hierarchical likelihood (/i-likelihood) strategy |LN96[ ILN06[ 
IdCLOSi IWLLdCTT] which, roughly speaking, consists of carrying out a likelihood estimation while 
considering the volatilities as unobserved parameters that are part of the optimization problem. 
Even though both methods are adequate volatility estimation techniques, the /i-likelihood tech- 
nique has a much wider range of applicability for it is not subjected to the rigidity of the state 
space representation necessary for Kalman and hence can be used for stochastic volatility models 
with complex link functions or when innovations are non-Gaussian. 

• Use of a pricing kernel: this term refers to a probability measure equivalent to the physical one 
with respect to which the discounted prices are martingales. The expectation of the discounted 
payoff of a derivative product with respect to any of these equivalent measures yields an arbitrage 
free price for it. In our work we will also use these martingale measures in order to devise 
hedging strategies via local risk minimization. This is admittedly just an approximation of the 
optimal local risk minimization scheme that should be carried out with respect to the physical 
measure that actually quantifies the local risk. However there are good reasons that advise the 
use of these equivalent martingale measures mainly having to do with numerical computability 
and interpretation of the resulting value processes as arbitrage free prices. These arguments are 
analyzed in Section [3. 2| 



Regarding the last point, there are two equivalent martingale measures that we will be using. The 
first one is inspired by the Extended Girsanov Principle introduced in [EM98j . This principle allows 
the construction of a martingale measure in a general setup under which the process behaves as its 
"martingale component" used to do under the physical probability; this is the reason why this measure 
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is sometimes referred to as the mean- correcting martingale measure, denomination that we will 
adopt. This construction has been widely used in the GARCH context (see [BEK+lll lOrtlO] and 
references therein) where it admits a particularly simple and numerically efficient expression in terms 
of the probability density function of the model innovations. As we will see, in the ARSV case, this 
feature is not anymore available and we will hence work with the expression coming from the predictable 
situation that even though in the ARSV case produces a measure that does not satisfy the Extended 



Girsanov Principle, it is still a martingale measure. Secondly, in Theorem 2.3 we construct the so called 
minimal martingale measure Qmin in the ARSV setup; the importance in our context of this measure 
is given by the fact that the value process of the local risk-minimizing strategy with respect to the physical 
measure for a derivative product coincides with its arbitrage free price when using Qmin as a pricing 
kernel. A concern with Qmin in the ARSV setup is that this measure is in general signed; fortunately, 
the occurrence of negative Radon-Nikodym derivatives is extremely rare for the usual parameter values 
that one encounters in financial time series. Consequently, the bias introduced by censoring paths that 
yield negative Radon-Nikodym derivatives and using Qmin as a well-defined positive measure is hardly 
noticeable. A point that is worth emphasizing is that even though the value processes obtained when 
carrying out local risk minimization with respect to the physical and the minimal martingale measures 
are identical, the hedges are in general not the same and consequently so are the hedging errors; this 



difference is studied in Proposition 3.6 



A general remark about local risk minimization that supports its choice in applications is its adapt- 
ability to difTerent hedging frequencies. Most sensitivity based (delta) hedging methods for time 
series type underlyings are constructed by discretizing a continuous time hedging argument; conse- 
quently, when the hedging frequency diminishes, the use of such techniques loses pertinence. As we 



explain in Section 3.3 the local risk minimization hedging scheme can be adapted to prescribed changes 
in the hedging frequency, which regularly happens in real life applications. 

The paper is organized in four sections. Section [2] contains a brief introduction to ARSV models and 
their dynamical features of interest to our study; this section contains two subsections that explain the 
volatility estimation techniques that we will be using and the martingale measures that we mentioned 
above. The details about the implementation of the local risk minimization strategy are contained in 
Section [Sj Section [4] contains a numerical study where we carry out a comparison between the hedging 
performances obtained by implementing the local risk minimization scheme using the different volatility 
estimation techniques and the martingale measures introduced in Section [2j In order to enrich the 
discussion, we have added to the study two standard sensitivity based hedging methods that provide good 
results in other contexts, namely Black-Scholes |BS72) and an adaptation of Duan's static hedge |Dua95) 
to the ARSV context. As a summary of the results obtained in that section it can be said that local 
risk minimization outperforms sensitivity based hedging methods, especially when maturities are long 
and the hedging frequencies are low; this could be expected due to the adaptability feature of local risk 
minimization with respect to modifications in this variable. As to the different measures proposed, it 
is the minimal martingale measure that yields the best hedging performances when implemented using a 
Kalman based estimation of volatilities, in the case of short and medium maturities and h-likelihood for 
longer maturities and low hedging frequencies. 

Conventions and notations: The proofs of all the results in the paper are contained in the appendix 
in Section |6] Given a filtered probability space {fl,P,J^,{J-t}teN) and X,Y two random variables, we 
will denote by Et[X] :— E[X\Tt] the conditional expectation, covt{X,Y) :— cov{X,Y\J't) := Et[XY] - 
Et[X]Et[Y] the conditional covariance, and by vart(X) := EtlX"^] — EtlX]"^ the conditional variance. A 
discrete-time stochastic process {ATtlfgpj is predictable when Xt is J-"(_i-measurable, for any t CzN. 

Acknowledgments: we thank Alexandru Badescu for insightful comments about our work that have 
significantly improved its overall quality. 
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2 Auto-regressive stochastic volatility (ARSV) models 

The auto-regressive stochastic volatihty (ARSV) model has been introduced in |Tay86| with the objective 
of capturing some of the most common stylized features observed in the excess returns of financial time 
series: volatility clustering, excess kurtosis, and autodecorrelation in the presence of dependence; this 
last feature can be visualized by noticing that financial log- returns time series exhibit autocorrelation at, 
say lag 1, close to zero while the autocorrelation of the squared returns is significantly not null. Let St be 
the price at time t of the asset under consideration, r the risk-free interest rate, and yt '■= log {St/ St-i) 
the associated log-return. In this paper we will be considering the so-called standard ARSV model 
which is given by the prescription 



yt - r + atet, {ej ~ IIDN(0, 1) 
bt = 7 + #t_i+w;t, ~IIDN(0,a2) 



2^ (2.1) 



where ht := log((T(), 7 is a real parameter, and 4> G (—1,1). Notice that in this model, the volatility 
process {at} is a non-traded stochastic latent variable that, unlike the situation in GARCH like mod- 
els |Eng82[ IBol86| is not a predictable process that can be written as a function of previous returns and 
volatilities. 



It is easy to prove that the unique stationary returns process induced by (2.1) available in the 
presence of the constraint 6 (~1, 1) is a white noise (the returns have no autocorrelation) with finite 
moments of arbitrary order. In particular, the marginal variance and kurtosis are given by 



var(yt) = E[crj] = exp 



7 ^1 



kurtosis {yt) = 3 exp ((t^) (2.2) 



Moreover, let a1 be the marginal variance of the stationary process {fot}, that is. 

It can be shown |Tay86| that whenever is small and/or (j) is close to one then the autocorrelation 
7(/i) of the squared returns at lag h can be approximated by 



l{h) 



exp((7g) - 1 ^ 
3exp(cr2) _ 1 



The existence of finite moments is particularly convenient in the context of quadratic hedging methods. 
For example, in Theorem 3.1 of [OrtlOj it is shown that the finiteness of kurtosis is a sufficient condition 
for the availability of adequate integrability conditions necessary to carry out pricing and hedging via 
local risk minimization. 



Let (il,P) be the probability space where the model (2.1 1 has been formulated and let Ft be the 
information set generated by the observables {S'o, 5*1, . . . , St}. This statement can be mathematically 
coded by setting Tt — o {Sq, 6*1, ... , St), where cr {Sq, Si, . . . ,St) is the sigma algebra generated by the 
prices {Sq, Si, . . . , St}. As several equivalent probability measures will appear in our discussion, we will 
refer to P as the physical or historical probability measure. 

Unless we indicate otherwise, in the rest of the discussion we will not assume that the innovations 
{et} are Gaussian. We will also need the following marginal and conditional cumulant functions of 
et with respect to the filtration T — {J-^t}te{o,....T}- 

Ll{z) = logi?^[e^^'], zeM, 

Kf^{u) = log i?^ [e"'^' I J^f_i] , with w a random variable. 

If the innovations {et} are Gaussian, we obviously have Lf^{z) — z^ jl. When the random variables u 
and et are P-independent, the following relation between and K^^ holds. 
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Lemma 2.1 Let u : {^,P) — > M 6e a random variable independent of et o,nd T = {J^t}te{o,....T} 
filtration introduced above. Then 



the 



Kl{u)= log EP [e^"(") I 



(2.3) 



The cumulant functions that we just introduced are very useful at the time of writing down the condi- 
tional means and variances of price changes with respect to the physical probability; these quantities 
will show up frequently in our developments later on. Before we provide these expressions we introduce 
the following notation for discounted prices: 

St := ^te-'-*, 

where r denotes the risk-free interest rate. The tilde will be used in general for discounted processes. 
Using this notation and Lemma [2. H a straightforward computation using the independence at any given 



(2.4) 
(2.5) 



time step t of the innovations {et} and the volatility process {at} defined by (2.1 ), shows that 



var 



St 
St 



St-i I J't-1 
St-1 I J' t-1 



St- 



(e<(-)-l) 



St-iE' 



— '-'t-1 



2.1 Volatility and model estimation 

The main complication in the estimation of ARSV models is due to the fact that Ft does not determine 
the random variable at and hence makes impossible the writing of a likelihood function in a tradi- 
tional sense. Many procedures have been developed over the years to go around this difficulty based 
on different techniques: moment matching |Tay86| , generalized method of moments |MT90[ IJPR941 
IAS96] , combinations of quasi- maximum likelihood with the Kalman filter |HRS94j , simulated maximum 
likelihood jDan94| . MCMC |JPR94| and, more recently, hierarchical-likelihood |LN961 iLNOej ldCL08| 
IWLLdCTT] (abbreviated in what follows as h-likelihood) . An excellent overview of some of these meth- 
ods is provided in Chapter 11 of the monograph |Tay05| . 

In this paper we will focus on the Kalman and h-likelihood approaches since both are based on 
numerically efficient estimations of the volatility, a point of much importance in our developments. 

State space representation and Kalman filtering. This approach is advocated by |HRS94j and 



consists of writing down the model (2.1) using a state space representation by setting 



h = log {\yt - r\) , Lt = log {at) . 



We assume in this paragraph that the innovations {et} are Gaussian. Using these variables, (2.1) can 
be rephrased as the observation and state equations: 

h = Lt+Ct, 6=log(|et|), ,2 6) 

[Lt-aK] = (j){Lt-i - ax) + Vt, rjt = \wt, 



where aK = E[Lt] = ^-^[fot] — 2{i-(t,) ■ Kalman filtering cannot be directly applied to (2.6) due to the 
non-Gaussian character of the innovations = log(|et|) hence, the approximation in [HRS94J consists 
of using this procedure by thinking of as a Gaussian variable with mean /ij :— E[S^t] — —0.63518 and 
variance (t| — Var[^t] = 7r^/8. The Kalman filter provides an algorithm to iteratively produce one-step 
ahead linear forecasts It^i := P[lt\J-t-i\] given that It = \og{at) -\- log(|e(|), these forecasts can be used 
to produce predictable estimations a^ of the volatilities at by setting log (cr^) := h-i.i — and hence 

erf = exp(?t_i,i - ^^). (2.7) 
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The /i-likelihood approach. This method |LN961 ILN061 IdCLOSl IWLLdCll| consists of carrying out 
a Hkehhood estimation while considering the volatihties as unobserved parameters that are part of the 
optimization problem. More specifically, let a := (7, </>, a^) be the parameters vector, b = (61, ... , bx) 
the vector that contains the unobserved bt := log((Tj ), and let Zt := yt — r; if f{zt\bt) and f{b\a) denote 
the corresponding probability density functions, the associated ft,- (log) likelihood is defined as 

T 

h{z;b,a) = Y.^og f{zt\bt)+ log f{b\a) 
t=i 

T 

(2.8) 



I Jl {^t exp(-6t) + bt + ^{bt-j- cbbt-if + logal^j 



The references listed above provide numerically efficient procedures to maximize (2.8) with respect to 
the variables b and a once the sample z has been specified. An idea of much importance in our context 
is that, once the coefficients a have been estimated, this procedure has a natural one-step ahead forecast 
of the volatility associated, similar to the one obtained using the Kalman filter. The technique operates 
using the variables bt ■= log((T|) alternating prediction and updating; more specifically, we initialize the 



algorithm using the mean of the unique stationary solution of (2.1 1, that is, bo := 7/(1 — 0). We use 



the second equation in (2.1 1 to produce a linear prediction bip of bi based on the value of 60, namely 
bip = 7-1- 4>bQ. Once the quote zi is available, the h- likelihood approach can be used to update this 
forecast to the value 6i„ by solving the optimization problem 



biu = argmin zf exp(-6i) + 61 H -(61 - 7 - ^^-^^ 



1 

This value can be used in its turn to produce a forecast b2p for 62 by setting b2p = 7 + 4>biu- If we 
continue this pattern we obtain a succession of prediction/updating steps given by the expressions 

hp = 7 + 0&(t-i)«: 

btu = argmin (ztexp(-6t) + 5i + ^(6i - 7 - 06(t_i)„)^ 



btei 



These forecasts can be used to produce predictable estimations o-f of the volatilities at by setting 



1. 



at := exp \^-btp 

Remark 2.2 Even though both the Kalman and the /i-likelihood methods constitute adequate estima- 
tion and volatility forecasting methods, the /i-likelihood technique has a much wider range of applicabil- 
ity for it is not subjected to the rigidity of the state space representation and hence can be generalized 
to stochastic volatility models with more complex link functions than the second expression in ( |2.1[ ), to 
situations where the innovations are non-Gaussian, or there is a dependence between {e^} and {wt}. 



2.2 Equivalent martingale measures for stochastic volatility models 

Any technique for the pricing and hedging of options based on no-arbitrage theory requires the use of 
an equivalent measure for the probability space used for the modeling of the underlying asset under 
which its discounted prices are martingales with respect to the filtration generated by the observables. 
Measures with this property are usually referred to as risk-ncutral or simply martingale measures and 
the Radon-Nikodym derivative that links this measure with the historical or physical one is called a 
pricing kernel. 
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A number of martingale measures have been formulated in the literature in the context of GARCH- 
like time discrete processes with predictable conditional volatility; see iBEK+ll for a good comparative 
account of many of them. Those constructions do not generalize to the SV context mainly due to the 
fact that the volatility process {at} is not uniquely determined by the price process {St} and hence it 
is not predictable with respect to the filtration J" = {J^t} generated by {St}. 

In this piece of work we will explore two solutions to this problem that, when applied to option 
pricing and hedging, yields a good combination of theoretical computability and numerical efficiency. 
The first one has to do with the so called minimal martingale measure and the second approach is based 
on an approximation inspired in the Extended Girsanov Principle [EM98 . 

The minimal martingale measure. As we will see in the next section, this measure is particularly 
convenient when using local risk minimization with respect to the physical measure |FS86[ IFSQli IFS02) 
as a pricing/hedging technique, for it provides the necessary tools to interpret the associated value 
process as an arbitrage free price for the contingent product under consideration. 

The minimal martingale measure Qmin is a measure equivalent to P defined by the following property: 
every P-martingale M g L'^{fl,P) that is strongly orthogonal to the discounted price process S, is also 
a Qmin-martingale. The following result spells out the specific form that Qmin takes when it comes to 
the ARSV models. 



Theorem 2.3 Consider the price process S = {Sq, Si, . . . , St} associated to the ARSV model given 
by the expression (2.1). In this setup, the minimal martingale measure is determined by the Radon- 
Nikodym derivative dQmin/dP that is obtained by evaluating at time T the P-martingale {^t}tg{i,...,T} 
defined by 



n 

k=l 



1 



(2.9) 



Proof. By Corollaries 10.28 and 10.29 and Theorem 10.30 in |FS02j . the minimal martingale measure, 
when it exists, is unique and is determined by the Radon-Nikodym derivative obtained by evaluating at 
T the P-martingale 



Zt:=\{l + \k (Yu - Yk-i) , where Afc 



fc=i 



Sk — Sk~i 



varfc^i [Sk - Sk-i 



(2.10) 



and Yk is the martingale part in the Doob decomposition of Sk with respect to P. Hence we have 



Yk - Yk-i ^Sk- E[^i 



Sk 



Sk-ile""'''' -e^'f^^""' 



(2.11) 



The equality (2.91 follows from (2.10), (2.111, (2.4), and (2.51 



Remark 2.4 The measure Q„ 



obtained by using (|2.9|) is in general signed. Indeed, as the random 

' can take arbi- 



variable cr/jefc is J^/j-adapted and Kf^{ak) is J^fc-predictable, the term 
trarily negative values that can force Zt to become negative. We will see in our numerical experiments 
that even though this is in general the case, negative occurrences are extremely unlikely for the usual 
parameter values that one encounters in financial time series. Consequently, the bias introduced by cen- 
soring paths that yield negative Radon-Nikodym derivatives and using Qmin as a well-defined positive 
measure is not noticeable. 
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Remark 2.5 The quantity Kf^{(Tt) = \ogE^ 



Ft~i] in (2.9) is in general extremely difficult to 



compute. Lemma 2.1 allows us to rewrite it as a conditional expectation of a functio n tha t depends only 

to approximate 



2.1 



on at and hence it is reasonable to use the estimations ai and CTj' introduced in Section 
Kf^{(7t). For example, if the innovations {ct} are Gaussian and we hence have Lf^(z) — we will 

approximate 

<(aO=log<i 

by ((7^)^/2 or ((t^)^/2 depending on the technique (Kalman or ^.-likelihood, respectively) used to esti- 
mate the conditional volatility. 



The Extended Girsanov Principle and the mean correcting martingale measure. This con- 
struction has been introduced in |EM98j as an extension in discrete time and for multivariate processes 
of the classical Girsanov Theorem. This measure is designed so that when the process is considered with 
respect to it, its dynamical behavior coincides with that of its martingale component under the original 
historical probability measure. These martingale measures are widely used in the GARCH context (see 
for example [BEK+llI lOrtlO] and references therein) . 



In this paragraph we will work with models that are slightly more general than (2.1 1 in the sense 



that we will allow for predictable trend terms and generalized innovations, that is, our ARSV model 
will take the form 

yt = mt + atet, {ej - IID(0, 1) 
bt = j + 4>bt-i+wt, {wt} ^llB{0,al) 

with mt an -measurable random variable. 



(2.12) 



Theorem 2.6 Given the model specified by (2.12), let f^^^^ be the conditional probability density func- 
tion under P of the random variable attt given J-t-i and let {Nt} be the stochastic discount factors 
defined by: 



Nt 



/i:,^ jatet + m« - r + log Ef_, [e^^^^]) 

fateMtet) 



The process Zt :— Y[k=iNk, is a {Tt, P) -martingale such that E^[Zt] — 1 and Zt defines in the 
ARSV setup the martingale measure Qegp associated to Extended Girsanov Principle via the identity 
Zt = dQEGp/dP. 

Remark 2.7 As at and ej are independent processes, Rohatgi's formula yields: 

1 



Unfortunately, there is no closed form expression for this integral even in the Gaussian setup. Though 
ad hoc numerical have been developed to treat it (see for instance |GLD04j ). the computation of the 
stochastic discount factors Nt for each time step t may prove to be a computationally heavy task. 

The point that we just raised in the remark leads us to introduce yet another martingale measure 
that mimics the good analytical properties of the Extended Girsanov Principle in the GARCH case but 
that, in the ARSV setup, only shares approximately its other defining features. In order to present this 
construction we define the market price of risk process: 



Pt 



mt + K^(at) - r 



(2.13) 



Hedging of time discrete auto-regressive stochastic volatility options 



9 



to which we can associate what we will call the stochastic discount factor: 

.r ( ^ ft{<^t+Pt) 

Nt[et,pt) = — .p, - , 
/t (et) 

where ff is the conditional probability density function of the innovations {et} with respect to the 
measure P given Tt-^i- 

Theorem 2.8 Using the notation that we just introduced, we have that 

(i) The process Zt :~ 111=1 ^k, "is a {J-'t, P) -martingale such that E^[Zt] = 1. 

(ii) Zx defines an equivalent measure Qmc such that Zj- — dQ-^^^jdP under which the discounted price 

process {Sq, Si, . . . ^ St} determined by {2.12) is a martingale. By an abuse of language we will 
refer to Q^c as the j4i?S'V" mean correcting martingale measure. 



3 Local risk minimization for ARSV options 

In this section we use the local risk minimization pricing/hedging technique developed by FoUmer, 
Schweizer, and Sondermann (see |FS86[ IFS91[ ISchOlj . and references therein) as well as the different 
measures and volatility estimation techniques introduced in the previous section to come up with prices 
and hedging ratios for European options that have an ARSV process as model for the underlying asset. 

As we will see, this technique provides simultaneous expressions for prices and hedging ratios that, 
even though require Monte Carlo computations in most cases, admit convenient interpretations based on 
the notion of hedging error minimization that can be adapted to various models for the underlying asset. 
This hedging approach has been studied in [OrtlOj in the context of GARCH models with Gaussian 
innovations and in |B011| for more general innovations. Additionally, the technique can be tuned in 
order to accommodate different prescribed hedging frequencies and hence adapts very well to realistic 
situations encountered by practitioners. In the negative side, like any other quadratic method, local 
risk minimization penalizes equally shortfall and windfall hedging errors, which may sometimes lead to 
inadequate hedging decisions. 

3.1 Generalized trading strategies and local risk minimization 

We now briefly review the necessary concepts on pricing by local risk minimization that we will needed 
in the sequel. The reader is encouraged to check with Chapter 10 of the excellent monograph |FS02j for 
a self-contained and comprehensive presentation of the subject. 

As we have done so far, we will denote by St the price of the underlying asset at time t. The symbol 
rt denotes the continuously composed risk-free interest rate paid on the currency of the underlying in 
the period that goes from time t — 1 to t; we will assume that {rt} is a predictable process. Denote by 

t 

Rt 

3=0 

The price at time t of the riskless asset S*" such that = 1, is given by 5° — e^*. 

Let now H{St) be a European contingent claim that depends on the terminal value of the risky asset 
St. In the context of an incomplete market, it will be in general impossible to replicate the payoff H by 
using a self-financing portfolio. Therefore, we introduce the notion of generalized trading strategy, 
in which the possibility of additional investment in the riskless asset throughout the trading periods 
up to expiry time T is allowed. All the following statements are made with respect to a fixed filtered 
probability space {Q, P, , {J^t}te{o,...,T})- 
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• A generalized trading strategy is a pair of stochastic processes (C^jC) such that {Ct'}tg{o,...,T} 
is adapted and {^t}te{i....,T} is predictable. The associated value process V of is defined 
as 

Vo-.^e,, and Vt := ^'^ ■ + ■ St, t>l. 

• The gains process G of the generahzed trading strategy is given by 

t 

Go:=0 and Gt ■.= Y,^k ■ {Sk - Sk-i), t = 0,...,T. 
fc=i 

• The cost process G is defined by the difference 

Gt:=Vt-Gu t = 0,...,T. 

These processes have discounted versions Vt, Gt, and Ct defined as: 



Vt 



Gt :=^a-(5fc-^fc-i), and Gt := Vt - Gt 



fe=i 



Assume now that both H and the {Sn}ne{o,...,T} are in L^(ri,P). A generahzed trading strategy is 
called admissible for H whenever it is in L'^{fl, P) and its associated value process is such that 

Vt = H, P a.s., Vt, Gt G L^ifl, P), for each t. 

The hedging technique via local risk minimization consists of finding the strategies ^) that minimize 
the local risk process 



Rt{e,0 ■=E'' {Ct+i-Ctf\Ft 



i = 0,...,T-1, 



(3.1) 



within the set of admissible strategies (C^iO- More specifically, the admissible strategy is called 

local risk-minimizing if 

for all t and each admissible strategy It can be shown that |FS021 Theorem 10.9] an admissible 

strategy is local risk-minimizing if and only if the discounted cost process is a P-martingale and it 
is strongly orthogonal to S, in the sense that covt(S'f+i — St,Ct+i — Ct) = 0, P-a.s., for any t = 
0, . . . ,T — 1. More explicitly, whenever a local risk-minimizing technique exists, it is fully determined 
by the backwards recursions: 



Vt = H, 

coyP iVt+uSt+i- St \ Ft) 



t+i 



vaiP{St+i - St I Ft) 



Vt = E' 



Vt+i I Ft 



'(St+i~St) 



Tt 



(3.2) 
(3.3) 

(3.4) 



The initial investment Vq determined by these relations will be referred to as the local risk minimiza- 
tion option price associated to the measure P. 
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3.2 Local risk minimization with respect to a martingale measure 



As we have explicitly indicated in expressions ( |3.2[ )-( |3.4[ ), the option values and the hedging ratios that 
they provide depend on a previously chosen probability measure and a filtration. In the absence of 
external sources of information, the natural filtration {J-^t} to be used is the one generated by the price 
process. Regarding the choice of the probability measure, the first candidate that should be considered 
is the physical or historical probability associated to the price process due to the fact that from a risk 
management perspective this is the natural measure that should be used in order to construct the local 



risk (3.11 



In practice, the use of the physical probability encounters two major difficulties: on one hand. 



when (3.2)-(3.4) are written down with respect to this measure, the resulting expressions are convoluted 



and numerically difficult to estimate due to the high variance of the associated Monte Carlo estimators; 
a hint of this difficulty in the GARCH situation can be seen in Proposition 2.5 of |OrtlO| . Secondly, 
unless the discounted prices are martingales with respect to the physical probability measure or there 
is a minimal martingale measure available, the option prices that result from this technique cannot be 
interpreted as arbitrage free prices. 

These reasons lead us to explore the local risk minimization strategy for martingale measures and, 
more specifically for the martingale measures introduced in Section [2.2[ Given that the trend terms that 
separate the physical measure from being a martingale measure are usually very small when dealing with 
daily or weekly financial returns, it is expected that the inaccuracy committed by using a conveniently 
chosen martingale measure will be smaller than the numerical error that we would face in the Monte 
Carlo evaluation of the expressions corresponding to the physical measure; see |OrtlO[ Proposition 3.2] 
for an argument in this direction in the GARCH context. In any case, the next section will be dedicated 
to carry out an empirical comparison between the hedging performances obtained using the different 
measures in Section |2.2[ as well as the various volatility estimation techniques that we described and 
that are necessary to use them. 



The following proposition is proved using a straightforward recursive argument in expressions (3.2 1 



(3.4 1 combined with the martingale hypothesis in its statement. 



Proposition 3.1 Let Q be an equivalent martingale measure for the price process {St} and H(St) be 
a European contingent claim that depends on the terminal value of the risky asset St ■ The local risk 
minimizing strategy with respect to the measure Q is determined by the recursions: 



whe 



Vt 

:=varQ(5t+i-5J 



e-iRT+R.)H{ST) {St+ie-^'+' - St) 



H{Sj 



(3.5) 
(3.6) 

(3.7) 



-2Rt 



[S\ 



e^^^'var'^ 



(5t+ie-'-*+i I Tt). (3.8) 



Remark 3.2 When expression (3.7 1 is evaluated at i = it yields the initial investment Vb necessary 



to setup the generalized local risk minimizing trading strategy that fully replicates the derivative H and 
coincides with the arbitrage free price for H that results from using Q as a pricing measure. Obviously, 
this connection only holds when local risk minimization is carried out with respect to a martingale 
measure. 



Remark 3.3 Local risk-minimizing trading strategies computed with respect to a martingale mea- 
sure Q also minimize |FS021 Proposition 10.34] the so called remaining conditional risk, defined as 
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the process i?^ (^°,^) := Et[{CT — Ct)'^], t — 0, . . . ,T; this is in general not true outside the martin- 
gale framework (see [SchOll Proposition 3.1] for a counterexample). Analogously, local risk minimizing 



strategies are also variance-optimal, that is, they minimize 



H-Vo~Gq 



(see [FS02i Propo- 



sition 10.37]). This is particularly relevant in the ARSV context in which a standard sufficient condition 
(see |FS021 Theorem 10.40]) that guarantees that local risk minimization with respect to the physi- 
cal measure implies variance optimality does not hold; we recall that this condition demands for the 
deterministic character of the quotient 



var^i [St - St-i] 



Indeed, expressions (2.4 1 and (2.5), together with Lemma 2.3 imply that in our situation 



e " 



which is obviously not deterministic. 



Remark 3.4 The last equality in (3.8 1 is of much importance when using Monte Carlo simulations to 



evaluate (3.5)-(3.7) and suggests the correct estimator that must be used in practice: if we generate 



under the measure Q a set of N price paths all of which start at St at time t and take values S^^^ , ■ ■ ■ , S^^ 
at time t + I, then in view of the last equality in (3.8) we write 



e-2«*var'?(5't+ie^'''+i | J't 



St+i^ 



i=l 



-rt+l 



St 



We warn the reader that an estimator for S^^^ based on the Monte Carlo evaluation of 

e-'^^E?[Sl,e-'^-^^^-Sf] 



would have much more variance and once inserted in the denominator of (3.6) would produce unaccept- 
able results. 



Remark 3.5 The Monte Carlo estimation of the expressions (3.5)-(3.7) requires the generation of 



price paths with respect to the martingale measure Q. In the GARCH context this can be easily carried 
out for a variety of pricing kernels by rewriting the process in connection with the new equivalent 
measure in terms of new innovations (see for example |Dua95[ lOrtlOi IGCIlOi IBEK+Il] ) . This method 
can be combined with modified Monte Carlo estimators that enforce the martingale condition and 
that have a beneficial variance reduction effect like, for example, the Empirical Martingale Simulation 
technique introduced in |DS98l IDGSOl] . Unfortunately, this is difficult to carry out in our context and, 
more importantly, it does not produce good numerical results. Given that all the pricing measures 
introduced in the previous section are constructed out of the physical one, the best option in our setup 
consists of carrying out the path simulation with respect to the physical measure and computing the 



expectations in (3.5)-(3.7) using the corresponding Radon-Nikodym derivative. More specifically, let Q 



be an equivalent martingale measure that is obtained out of the physical measure P by constructing a 
Radon-Nikodym derivative of the form: 

dQ 



TT M 
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such that the process {Zt} defined by Zt :— 111=1 is a P-martingale that satisfies [Zt] = 1; notice 
that all the measures introduced in the previous section are of that form. In that setup we define the 
process {Ft} by Ft := H^t ^^^^ allows us to rewrite Q-expectations in terms of P-expectations, that 
is, for any J-V measurable function / and any t e {0, . . . , T}, 

eQ [/ I Tt] = EP [Ft+^f I Tt] ■ 

The proof of this equality is a straightforward consequence of |FS021 Proposition A. 11] and of the specific 
way in which the measure Q is constructed. 



3.3 Local risk minimization and changes in the hedging frequency 

As we already pointed out one of the major advantages of the local risk minimization hedging scheme 
when compared to other sensitivity based methods, is its adaptability to prescribed changes in the 
hedging frequency. Indeed, suppose that the life of the option H with maturity in T time steps is 
partitioned into identical time intervals of duration j; this assumption implies the existence of an 
integer k such that kj — T. 

We now want to set up a local risk minimizing replication strategy for H in which hedging is carried 
out once every j time steps. We will denote by S^t+j the hedging ratio at time t that presupposes that 
the next hedging will take place at time t + j. The value of such ratios will be obtained by minimizing 
the j-spaced local risk process: 



t+j 



cif I :Ft 



t = o,j,2j,...,(fc-i)j = r-j, 



where {Cf} is a cost process constructed out of value and gains processes, {V/} and 1^(1 that only 
take into account the prices of the underlying assets at time steps t — 0, j, 2j, kj — T , in particular, 
given an integer I such that t = Ij 

I 

G{ := ^^rj ■ [Srj - S(^r-l)j)- 



A straightforward modification of the argument in [ FS021 Theorem 10.9] proves that the solution of this 
local risk minimization problem with modified hedging frequency with respect to a martingale measure 
is given by the expressions: 



_^^^ Ef[H{ST){St^je 



-{Rt+j-Rt 



St)] 



= e: 



<RT-Rt)Hi^ST) 



-2{Rt+j-Rt 



(3.9) 
(3.10) 

(3.11) 



for any t = 0, j, 2j, . . . , (fc — l)j — T ~ j. As a follow up to what we pointed out in the remark 3.4 the 



denominator in (3.10) should be computed by first noticing that 



E^ 



-Rt) 



St 



var, 



5, 



-{Rt 



-Rt 



and then using the appropriate Monte Carlo estimator for the variance, that is, 

N 



var'3(5t+ie-'-'+M -^t) ^ ^ E (^tV,t 



-(Rl+^-Rt) 



(3.12) 
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where Sl_^_j, . . . , S{^j are A'' realizations of the price process under Q at time t + j, using paths that have 
all the same origin St at time t. If the interest rate process {rt} is constant and equal to r, then (3.12) 
obviously reduces to 

varQ(5,+ie-'-'+^ I -^O ^ ^ E 'St)'. 



3.4 Local risk minimization and the minimal martingale measure 

Local risk-minimization requires picking a particular probability measure in the problem. We also saw 
that it is only when discounted prices are martingales with respect to that measure that the obtained 
value process for the option in question coincides with the arbitrage free price that one would obtain by 
using that martingale measure as a pricing kernel. This observation particularly concerns the physical 
probability which one would take as the first candidate to use this technique since it is the natural 
measure to be used when quantifying the local risk. 

The solution of this problem is one of the motivations for the introduction of the minimal martingale 



measure Qmin that we defined in Section 2.2 Indeed, it can be proved that (see Theorem 10.22 in |FS02] ) 
the value process of the local risk-minimizing strategy with respect to the physical measure coincides with 
arbitrage free price for H obtained by using Qmin as a pricing kernel. More specifically, if is the 



value process for H obtained out of formulas (3.2)-(3.4) using the physical measure then 



e-iRT-Rt)jj^S^^ (3 13) 



We emphasize that, as we pointed out in Remark 2.4 the minimal martingale measure in the ARSV 
setup is in general signed. Nevertheless, the occurrences of negative Radon-Nikodym derivatives are 
extremely unlikely, at least when dealing with Gaussian innovations, which justifies its use for local risk 
minimization. 

We also underline that even though the value processes obtained when carrying out local risk mini- 
mization with respect to the physical and the minimal martingale measures are identical, the hedges are 
in general not the same and consequently so are the hedging errors. In the next proposition we provide 
a relation that links both hedges and identify situations in which they coincide. Before we proceed 
we need to introduce the notion of global (hedging) risk process: it can be proved that once a 
probability measure P has been fixed, if there exists a local risk-minimizing strategy (^'^, ^) with respect 
to it, then it is unique (see |FS021 Proposition 10.9]) and the discounted payoff H can be decomposed 
as (see [FSHl Corollary 10.14]) 

H = Vo + Gt + Lt, (3.14) 

with Gt the discounted gains process associated to ($",0 and Lt :— Ct — Co, i = 0, . . . ,T a sequence 
{Lt}t<£{o....,T} that we will call the (discounted) global (hedging) risk process. {it}te{o,....T} is a square 
integrable P-martingale that satisfies Lq = and that is strongly orthogonal to S in the sense that 

cov''{iLt+,^Lt){St+i-St)\Tt)^0 for any t = 0,...,T-l. 



The decomposition (3.141 shows that Lt measures how far H is from the terminal value of the self- 
financing portfolio uniquely determined by the initial investment Vq and the trading strategy given by 
{Ct} (sec [LL08, Proposition 1.1.3]). 

Proposition 3.6 Let H(St) be a European contingent claim that depends on the terminal value of 
the risky asset St and let {^f""°}J=i o,nd {^f he the local risk minimizing hedges associated to 



Hedging of time discrete auto-regressive stochastic volatility options 



15 



the minimal martingale measure Qmin o,nd the physical measure P, respectively. Let \ Lf \ he the 

L J t=o 

associated P -global risk process. Then, for any i G {1, . . . , T}; 









vaifj 


i" 




St-i 





(3.15) 



// the processes \ Lf > and ISt} are either Qmin -strongly orthogonal or Qmin -independent, then 
{Ct """}t=i = }t=i- 



4 Empirical study 

The goal of this section is comparing the hedging performances obtained by implementing the local risk 
minimization scheme using the different volatility estimation techniques and martingale measures intro- 
duced in Section[2] In order to enrich the discussion, we will add to the list two standard sensitivity based 
hedging methods that provide good results in other contexts, namely standard Black-Scholes |BS72j and 
an adaptation of Duan's static hedge |Dua95) to the ARSV context. 

The way in which we proceed with the comparison consists of taking a particular ARSV model as 
price paths generating process and then carrying out the replication of a European call option according 
to the prescribed list of hedging methods for various moneyness, strikes and hedging frequencies. At 
the end of the life of each of this option, the square hedging error is recorded. This task is carried out 
for a number of independently generated price paths which allows us to estimate a mean square hedging 
error that we will use to compare the different hedging techniques. 

The price generating ARSV process. The price process chosen for our simulations is obtained by 
taking the model that has as associated log-returns one of the ARSV prescriptions studied in [WLLdCTT] . 



It consists of taking the following parameters in (2.1 1 



0.1/252, 7 = -0.821, (^ = 0.9, cr„ = 0.675. 



The use of formulas (2.2) shows that the resulting log-returns are very leptokurtic (kurtosis = 33.00) 
and volatile (the stationary variance is 9.01 • 10~^ which amounts to an annualized volatility of 47%). 
The intentionality behind this choice is exacerbating the defects of the different hedging methods in 
order to better compare them. 

The hedging methods. Given the price paths generated by the ARSV process detailed in the previous 
paragraph, we will carry out the replication of European call options that have them as underlying asset 
using local risk minimization with respect to the two martingale measures introduced in Section |2.2[ 
namely: 



ARSV mean correcting martingale measure introduced in Theorem |2.8| The hedging 
method is abbreviated as LRM-MCMM in the figures. 



Minimal martingale measure. Denoted by Qmin and introduced in Theorem [23] The hedging 
method is abbreviated as LRM-MMM in the figures. This measure may be signed in the ARSV 



context but this happens with extremely low probability (see Remark 2.4); in our simulations we 
did not encounter this situation even a single time. Local risk minimization with respect to this 
measure yields the same value process as the physical probability but not the same hedging ratios; 
Proposition |3.6| spells out the difference and provides sufficient conditions for them to coincide. 
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The construction of all these measures requires conditional expectations of functions of the conditional 
variance of the log-returns with respect to the filtration generated by the price process. As this filtration 
does not determine the conditional volatilities, those expectations cannot be easily computed; we hence 
proceed by estimating the conditional volatilities via Kalman filtering and a prediction-update routine 
based on the /i- likelihood approach, both briefly described in Section [2?T| Different volatility estimation 
techniques lead to different pricing/hedging kernels and hence to different hedging performances that 
we compare in our simulations. 

We will complete the comparison exercise with two more methods: 

• Black-Scholes: we forget that the price process is generated via an ARSV model and we handle 
the hedging using the standard Black-Scholes delta BS72) as if the underlying was a realization of 
a log-normal process with constant drift and volatility given by the drift and marginal volatility 



(given by the expression (2.2)) of the ARSV model specified above. 



• Duan's static delta hedge: this is a generalization of the Black-Scholes delta hedge that has 
been introduced in ,Dua95. Corollary 2.4] in the GARCH context. In that result the author 
computes the derivative of a European call price (the delta put can be readily obtained out of the 
call-put parity) with respect to the price of the underlying in an arbitrary incomplete situation 
in which that option price is simply obtained as the expectation of the discounted payoff with 
respect to a given pricing kernel Q, that is: 



Vt = E^ 



-«')ff(5T) I J't 



In those circumstances: 



fDuan 9Vt _ -(RT-Rt)-pQ 



In our empirical study we will use this hedging technique using the various pricing kernels previ- 
ously introduced. A dynamical refinement of this hedge can be obtained by considering additional 
dependences of the value process on the underlying price that may occur (like in our case) through 
the volatility process |GR99[ IB011| . This will not be considered in our study. 

Numerical specifications. The comparisons in performance will be carried out by hedging batches 
of European call options with different maturities and three different moneyness Sq/K of 1.11, 1, and 
0.90, and for a number of independent price paths. The initial price Sq of the underlying is always equal 
to 100. We will assign to each hedging experiment a prescribed hedging frequency. The mean square 
hedging error associated to each individual hedging method will be computed by taking the mean of 
the terminal square hedging errors for each of the price paths. In the particular case of the local risk 



minimization method, where the hedges are obtained by computing the expressions (3.9 1-(3.11 ), the 
conditional expectations that they contain will be estimated via Monte Carlo evaluations based on the 
use of 2500 price paths. 

The hedging exercises. We now explain in detail the different hedging experiments that we have 
carried out. 

• Exercise 1: all the hedging techniques explained in the paper are used. The four maturities 
considered are 6, 8, 10, and 12 time steps. Hedging is carried out daily. The computation of the 
mean square hedging error is based on 1000 price paths. The first thing that we notice is that, even 
for these short maturities, the performance of Duan's static hedge is very poor. This is specific to 
the ARSV situation for in the GARCH case this technique provides very competitive results [OrtlOj 
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even in the presence of processes excited by leptokurtic innovations |B011| . Additionally, the 
Black-Scholes hedging scheme provides remarkably good results, specially for in and at the money 
options. Regarding the local risk minimization strategy, the best results are obtained when using a 
combination of a Kalman based estimation of the conditional volatility and the minimal martingale 
measure Qmin- 

• Exercise 2: we have retaken the specifications of Exercise 1, but this time around we have 
eliminated from the list of techniques under examination Duan's static hedge due to its poor 
performance. The four maturities considered are 10, 20, 30, and 40 time steps. Hedging is carried 
out every 10 days. The main difference with the previous exercise is the degradation of the 
performance of the Black-Scholes scheme in comparison with local risk minimization. This is due 
to the fact that, unlike Black-Scholes, local risk minimization admits adjustment to the change 



of hedging frequency by using the formulas (3.9)^(3.111. Regarding local risk minimization it is 



still the combination of the minimal martingale measure with Kalman based estimation for the 
volatilities that performs the best. 

• Exercise 3; we experiment with longer maturities and smaller hedging frequencies. We consider 
maturities of 20, 40, 60, 80, 100, and 120 time steps with hedging carried out every 20 days. The 
computation of the mean square hedging error is based on 600 price paths. The novelty in these 
experiments with respect to the preceding ones is that in some instances the volatility estimation 
via /i-likelihood seems more appropriate than Kalman's method in this setup. 



5 Conclusions 

In this work we have reported on the applicability of the local risk minimization pricing/hedging tech- 
nique in the handling of European options that have an auto-regressive stochastic volatility (ARSV) 
model as their underlying asset. The method has been implemented using the following martingale 
measures: 

• The minimal martingale measure: even though it is in general signed, the probability of occurrence 
of negative Radon-Nikodym derivatives is extremely low and hence can be used to serve our 
purposes. 

• Mean correcting martingale measure: it is a measure that shares the same expression and good 
analytical and numerical properties as the measure obtained in the GARCH context out of the 
Extended Girsanov Principle. The reason to use this proxy instead of the genuine Extended 
Girsanov Principle is that in the ARSV case, this procedure produces a measure that is numerically 
expensive to evaluate. 

An added difficulty when putting this scheme into practice and that is specific to ARSV models, has to 
do with the need of estimating the volatility that, even though in this situation is not determined by the 
observable price process, it is necessary in order to construct the pricing/hedging kernels. Two techniques 
are used to achieve that: Kalman filtering and hierarchical likelihood (/i-likelihood). Both methods are 
adequate volatility estimation techniques, however the /i-likelihood technique has a much wider range of 
applicability for it is not subjected to the rigidity of the state space representation necessary for Kalman 
and hence can be used for stochastic volatility models with complex link functions or when innovations 
are non-Gaussian. 

We have carried out a numerical study to compare the hedging performance of the methods based on 
different measures and volatility estimation techniques. Overall, the most appropriate technique seems 
to be the choice that combines the minimal martingale measure with a Kalman filtering based estimation 
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Figure 4.1: Experiment 1. All the hedging techniques explained in the paper are used. The four maturities considered are 
6, 8, 10, and 12 time steps. Hedging is carried out daily. The computation of the mean square hedging error 
is based on 1000 price paths. 
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Figure 4.2: Experiment 2. The four maturities considered are 10, 20, 30, and 40 time steps. Hedging is carried out every 
10 days. The computation of the mean square hedging error is based on 1000 price paths. 
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Mean square hedging error 




tHH 




Figure 4.3: Experiment 6. The hedging techniques used are spelled out in the legend. Six maturities are considered: 40, 
60, 80, and 100 time steps. Hedging is carried out every 20 days. The computation of the mean square hedging 
error is based on 600 price paths. 
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of the volatility. ft,-likelihood based volatility estimation gains pertinence as maturities become longer 
and the hedging frequency diminishes. 



6 Appendix 



6.1 Proof of Lemma 12.11 

Let Z be an arbitrary J-t-i measurable random variable. As ej is independent of J-t-i and by hypothesis 
it is also independent of u, the joint law Pet.u,z of the random variable (e*, m, Z) : {fl, P) M.^ can be 
written as the product Pet,u.z{x,y, z) — P^^{x)Pu^z{y, z). Hence, using Fubini's Theorem we have 



i?^[e"^'Z] = J J ey-zdP,,{x)dP^^ziy,z)^ J J ey^dP.^ia 
e'^'^^y'^zdP^^z{y,z)^EP [e^^*(«)z . 



zdPu,z{y, z) 



(6.1) 



At the same time, by the definition of Kf^, for any J-t^i measurable random variable Z, the equality (6.1 ) 
implies that: 



[e"''Z] = E' 



As 



is Tt-i measurable and Z arbitrary, the almost sure uniqueness of the conditional expectation 



implies that 



as required. 



The result that we just proved is a particular case of the following Lemma that we state for reference. 
The proof follows the same pattern as in the previous paragraphs. 

Lemma 6.1 Let {Cl,J-,P) be a probability space and B d J- a sub-a -algebra. Let X and Y be two 
random variables such that X is simultaneously independent ofY and B. For any measurable function 
$ : ^ M define the mapping F -.R^Rby F{y) = E y)]. Then, 



Emx,Y)\B]^E[F{Y)\B]. 



6.2 Proof of Theorem 



The proof is a straightforward consequence of Theorem 3.1 in [EM98| . Indeed, it suffices to identify 
in the ARSV case the two main ingredients necessary to carry out the Extended Girsanov Principle, 
namely the one period excess discounted return fit defined by: 



St-i 



K 



with K :— 1/E['_^ [e°'*'^*], and the process {Wt} uniquely determined by the relation: 



Wt := 



St 



St-i 



The Theorem follows from expression (3.8) in [EM98| . 
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6.3 Proof of Theorem [2T8l 



(i) We start by noticing that since et is independent of both pt and Ft-i, by Lemma 6.1 
where the real function F : M M is defined by 



(6.2) 



F{y)^EP meuy)]= j 



f''{x)dx = / + y)dx = / f''{x)dx = 1 



which substituted in (6.2) yields 



Ef^,[Nt{et,pt)] = l. 



(6.3) 



This equality proves immediately the martingale property for {Zf}. Indeed, 

Ef_, [Zt] = E[_, [Nt] Zt_, = 

Finally, as {Zt} is a P-martingale 

EP [Zt] - E^ [Zi] = EP [N,] = 1. 

(ii) Zt is by construction non-negative and E[Zt] = V{Zt > 0) = 1. This guarantees (see, for example. 
Remarks after Theorem 4.2.1 in LL08J) that Qmc is a probability measure equivalent to P. 
We now start proving that Qam is a martingale measure by showing that: 



pQL 



- <i [ 



(6.4) 



Indeed, 



Et-i 



St- 



Zt-i 

1 

'Z^i 



Zt-1 



Ef_, [Zt] 



Zt-i 
1 

Z^i 



' Zt-i 



Now, as et is independent of both <7t and J-t-i, we can use Lemma 6.1 to prove that 
Indeed, we can write 



(6.5) 



(6.6) 



where the function F 



is defined by 



F{y,z) = EP[ey^^Nt{et,z)]= j eyyP{x)^-^^p^dx = j ey^^-'-^f{s)ds 



E' 
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which substituted in (6.6) yields (6.5). Finally, if we insert in (6.5) the explicit expression (2.13) that 
defines the market price of risk, we obtain that 

St 



^f-i 



St^i 



= <i 



L (CTt) 

e ' 



as required. The last equality in the previous expression follows from Lemma |2.1 



6.4 Proof of Proposition 3.6 



Consider the unique decomposition ( 3.14 ) of the discounted payoff H as a sum of the discounted P-gains 
and P-risk processes: 



H = V„ + Y,^k ■{Sk~Sk-i]+L. 

k=l 



(6.7) 



We recall that the minimal martingale measure is characterized by the property that every P-martingale 
that is strongly orthogonal to the discounted price process j is also a Qmin-martingale. Hence, 

as the P-martingale is P-strongly orthogonal to |s'f| , it is therefore a Qmin-martingale. 
L J t=o I- J t=o 

Having this in mind, as well as (3.13), we take conditional expectations E^J^^'" on both sides of the 
decomposition (6.7) and we obtain: 



Vt'' = Vo + 



-Sk-i]+L 



fc=i 



As Vj^ is the local risk minimizing value process for both the P and the Qmin measures, that is V/' — 
yQmiii^ if we multiply both sides of by (^St — ■S'f-i^ and take conditional expectations E^_°^" we have: 



[St - St-i) 



e: 



z 

Vo{St-St-i)] +Y.El"'^ 



fe=i 



Cfe ■ \Sk~ Sk-1 ] (St - s, 



)0 



+E^-^ 



^t i^St — St-i 



(^St — 5'f_i^ 



E. 



t-i 



Lf [St ~ St-i 



(^St-St-i)' 



and we use the relation (3.3) we obtain 



^t-1 


'LfiSt-St-i) 




St — St-i 





If we divide both sides of this equality by iJ^"'" 
that 

as required. Finally, we notice that 

E?_T [(Pf - Pf-i) (St ~ St-i)] = E?_1T [LfiSt - St-i)]-E?_lT [Ll^iSt - St^i)] = Pf™ ffl^t - St-^ 

hence, either the (5,ni„-independence or the orthogonality of |pf I and \ St\ imply that e9jV" LflSt — St-\) 

L J t=o L J t=o L 

and hence U?™}?:i = Ur}r=i- ■ 
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